NITRYL HYDRATASE GENE AND AMIDASE GENE FROM 
RHODOCOCCUS BACTERIUM, AND PROCESS FOR PRODUCING 
CARBOXYUC ACIDS USING THEM 

•June 26, 2001 
E. Kramer. 202-293-7060 
Sequence Listing 



09/869142 

SBQUENCE LISTING JC03 RecM PCT/PTC 2 6 JUN 2001 



<110> SHOWA DENKO K.K. 



<120> NOVEL RHODOCOCCUS BACTERIA, NITRILASE GENE, NITRYL HYDRATASE GENE AND 
AMIDASE GENE FROM RHODOCOCCUS BACTERIUM, AND PROCESS FOR PRODUCING CARBOXYLIC 



ACIDS 


USING THEM 


<130> 


Q64574 


<150> 
<151> 


USSN 60/183,754 
2000-02-22 


<150> 
<151> 


USSN 60/183, 821 
2000-02-22 


<150> 
<151> 


JPA 2000-107855 
2000-04-10 


<150> 
<151> 


JPA 2000-021797 
2000-01-26 


<150> 
<151> 


JPA 11-303212 
1999-10-26 


<160> 


7 


<170> 


Patentln version 


<210> 
<211> 
<212> 
<213> 


1 

1531 
DNA 

Rhodococcus sp . 


<220> 
<221> 
<222> 
<223> 


exon 

(324) . . (1421) 


<400> 


1 



agcttgacca tgattacgaa ttcgagctcg gtacccgggg atcgaaccag caaeggggae 60 

gcacagtcga cgtagacctc gacctatccg ccgttccgca gaaggacacc gaccaccacc 120 

acttcaacat ccttcaacgt gcccggccag tccttcgacg aatcgaaacg gegaagagee 180 

gcctcggacc ccccggccga accgctcgat gaactcccct acacgggtgg cgcagaatgc 240 

caggacccgt gtcattccac gtcaattcac gcgccttttc acctcgtact gtcctgccaa 300 



acacaagcaa eggaggtacg gac atg gtc gaa tac aca aac aca ttc aaa gtt 

Met Val Glu Tyr Thr Asn Thr Phe Lys Val 
15 10 



353 



get gcg gtg cag gca cag cct gtg tgg ttc gac gcg gec aaa acg gtc 401 

Ala Ala Val Gin Ala Gin Pro Val Trp Phe Asp Ala Ala Lys Thr Val 
15 20 25 

gac aag acc gtg tec ate ate gcg gaa gca gee egg aac ggg tgc gag 449 
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KTTRYL HYDRATASE GENE AND AMID AS E GENE FROM 
RHODOCOCCUS BACTERIUM, AND PROCESS FOR PRODUCING 
CARBOXYUC ACIDS USING THEM 
Fil^^mc 26, 2001 

202-293-7060 

Sequence Listing 

* -* 

Asp Lys Thr Val Ser He He Ala Glu Ala Ala Arg Asn Gly Cys Glu 
30 35 40 

etc gtt gcg ttt ccc gag gta ttc ate ccg ggg tac ccg tac cac ate 4 97 

Leu Val Ala Phe Pro Glu Val Phe He Pro Gly Tyr Pro Tyr His He 
45 50 55 



tgg gtc gac age ccg etc gec gga atg gcg aag ttc gee gtg cgc tac 
Trp Val Asp Ser Pro Leu Ala Gly Met Ala Lys Phe Ala Val Arg Tyr 
60 65 70 

cac gag aat tec ctg acg atg gac age ccg cac gta cag egg ttg etc 
His Glu Asn Ser Leu Thr Met Asp Ser Pro His Val Gin Arg Leu Leu 
75 80 85 90 

gat gee gee cgc gac cac aac ate gee gta gtg gtg gga ate age gag 
Asp Ala Ala Arg Asp His Asn He Ala Val Val Val Gly He Ser Glu 
95 100 105 

egg gat ggc ggc age ttg tac atg acc cag etc ate ate gac gec gat 
Arg Asp Gly Gly Ser Leu Tyr Met Thr Gin Leu He He Asp Ala Asp 
110 115 120 

ggg caa ctg gtc gee cga cgc cgc aag etc aag ccc acc cac gtc gag 
Gly Gin Leu Val Ala Arg Arg Arg Lys Leu Lys Pro Thr His Val Glu 
125 130 135 

cgt teg gta tac gga gaa gga aac ggc teg gat ate tec gtg tac gac 
Arg Ser Val Tyr Gly Glu Gly Asn Gly Ser Asp He Ser Val Tyr Asp 
140 145 150 

atg cct ttc gca egg ctt ggc gcg etc aac tgc tgg gag cat ttc cag 
Met Pro Phe Ala Arg Leu Gly Ala Leu Asn Cys Trp Glu His Phe Gin 
155 160 165 170 

acg etc acc aag tac gca atg tac teg atg cac gag cag gtg cac gtc 
Thr Leu Thr Lys Tyr Ala Met Tyr Ser Met His Glu Gin Val His Val 
175 180 185 

gcg age tgg cct ggc atg teg ctg tac cag ccg gag gtc ccc gca ttc 
Ala Ser Trp Pro Gly Met Ser Leu Tyr Gin Pro Glu Val Pro Ala Phe 
190 195 200 

ggt gtc gat gee cag etc acg gec acg cgt atg tac gca etc gag gga 
Gly Val Asp Ala Gin Leu Thr Ala Thr Arg Met Tyr Ala Leu Glu Gly 
205 210 215 

caa acc ttc gtg gtc tgc acc acc cag gtg gtc aca ccg gag gee cac 
Gin Thr Phe Val Val Cys Thr Thr Gin Val Val Thr Pro Glu Ala His 
220 225 230 

gag ttc ttc tgc gag aac gag gaa cag cga atg ttg ate ggc cga ggc 
Glu Phe Phe Cys Glu Asn Glu Glu Gin Arg Met Leu He Gly Arg Gly 
235 240 245 250 

gga ggt ttc gcg cgc ate ate ggg ccc gac ggc cgc gat etc gca act 
Gly Gly Phe Ala Arg He He Gly Pro Asp Gly Arg Asp Leu Ala Thr 
255 260 265 

cct etc gee gaa gat gag gag ggg ate etc tac gee gac ate gat ctg 
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NITRYL HYDRATASE GENE AND AMID AS E GENE FROM 
RHODOCOCCUS BACTERIUM, AND PROCESS FOR PRODUCING 
CARBOXYUC ACIDS USING THEM 
FHj^Jtme 26, 2001 

Kramer. 202-293-7060 

^^^H[ Sequence Listing 

Pro Leu Ala Glu Asp Glu Glu Gly lie Leu Tyr Ala Asp lie Asp Leu 
270 275 280 

tct gcg ate acc ttg gcg aag cag gec get gac ccc gtg ggc cac tac 
Ser Ala He Thr Leu Ala Lys Gin Ala Ala Asp Pro Val Gly His Tyr 
285 290 295 

tea egg ccg gat gtg ctg teg ctg aac ttc aac cag cgc cgc acc acg 
Ser Arg Pro Asp Val Leu Ser Leu Asn Phe Asn Gin Arg Arg Thr Thr 
300 305 310 



<210> 2 
<211> 366 
<212> PRT 

<213> Rhodococcus sp. 
<400> 2 

Met Val Glu Tyr Thr Asn Thr Phe Lys Val Ala Ala Val Gin Ala Gin 
15 10 15 



Pro Val Trp Phe Asp Ala Ala Lys Thr Val Asp Lys Thr Val Ser He 
20 25 30 



He Ala Glu Ala Ala Arg Asn Gly Cys Glu Leu Val Ala Phe Pro Glu 
35 40 45 



Val Phe He Pro Gly Tyr Pro Tyr His He Trp Val Asp Ser Pro Leu 
50 55 60 

Ala Gly Met Ala Lys Phe Ala Val Arg Tyr His Glu Asn Ser Leu Thr 
65 70 75 80 



1217 



1265 



1409 



ccc gtc aac acc cca ctt tec acc ate cat gee acg cac acg ttc gtg 1313 
Pro Val Asn Thr Pro Leu Se.r Thr He His Ala Thr His Thr Phe Val 
315 320 325 330 

ccg cag ttc ggg gca etc gac ggc gtc cgt gag etc aac gga gcg gac 13 61 

Pro Gin Phe Gly Ala Leu Asp Gly Val Arg Glu Leu Asn Gly Ala Asp 
335 340 345 

gaa cag cgc gca ttg ccc tec aca cat tec gac gag acg gac egg gcg 
Glu Gin Arg Ala Leu Pro Ser Thr His Ser Asp Glu Thr Asp Arg Ala 
350 355 360 

aca gec acc etc tgacteggge gcacccgtgg cgcctccgaa gcgccacggg 1461 
Thr Ala Thr Leu 
365 

tgtgtgaagg ggcgagacag gggaatcgga ggatccccgg gtaccgagct egaattegta 1521 
atcatggtca 1531 



Met Asp Ser Pro His Val Gin Arg Leu Leu Asp Ala Ala Arg Asp His 
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Sequence Listing '-^^^^V 

85 90 95 



Asn lie Ala Val Val Val Gly He Ser Glu Arg Asp Gly Gly Ser Leu 
100 105 HO 



Tyr Met Thr Gin Leu He He Asp Ala Asp Gly Gin Leu Val Ala Arg 
115 120 125 



Arg Arg Lys Leu Lys Pro Thr His Val Glu Arg Ser Val Tyr Gly Glu 
130 135 140 



Gly Asn Gly Ser Asp He Ser Val Tyr Asp Met Pro Phe Ala Arg Leu 
145 150 155 160 



Gly Ala Leu Asn Cys Trp Glu His Phe Gin Thr Leu Thr Lys Tyr Ala 
165 170 175 



Met Tyr Ser Met His Glu Gin Val His Val Ala Ser Trp Pro Gly Met 
180 185 190 



Ser Leu Tyr Gin Pro Glu Val Pro Ala Phe Gly Val Asp Ala Gin Leu 
195 200 205 



Thr Ala Thr Arg Met Tyr Ala Leu Glu Gly Gin Thr Phe Val Val Cys 
210 215 220 



Thr Thr Gin Val Val Thr Pro Glu Ala His Glu Phe Phe Cys Glu Asn 
225 230 235 240 



Glu Glu Gin Arg Met Leu He Gly Arg Gly Gly Gly Phe Ala Arg He 
245 250 255 



He Gly Pro Asp Gly Arg Asp Leu Ala Thr Pro Leu Ala Glu Asp Glu 
260 265 270 



Glu Gly He Leu Tyr Ala Asp He Asp Leu Ser Ala He Thr Leu Ala 
275 280 285 



Lys Gin Ala Ala Asp Pro Val Gly His Tyr Ser Arg Pro Asp Val Leu 
290 295 300 



Ser Leu Asn Phe Asn Gin Arg Arg Thr Thr Pro Val Asn Thr Pro Leu 
305 310 315 320 



Ser Thr He His Ala Thr His Thr Phe Val Pro Gin Phe Gly Ala Leu 
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325 330 335 

Asp Gly Val Arg Glu Leu Asn Gly Ala Asp Glu Gin Arg Ala Leu Pro 
340 345 350 

Ser Thr His Ser Asp Glu Thr Asp Arg Ala Thr Ala Thr Leu 
355 360 365 

<210> 3 
<211> 2822 
<212> DNA 

<213 > Rhodococcus sp . 
<220> 

<221> CDS 

<222> (1379) . . (2068) 

<223> nitrile hydratase beta subunit 
<220> 

<221> CDS 

<222> (2082) . . (2693) 

<223> nitrile hydratase alpha subunit 
<400> 3 

ctagaggatc teggtcateg cgataccatc gttgcggacg atgatgtcca ataegtacca 60 

ctggtccgcg gtcaacttct cttgatcgac cacgttatgg attctacgac tcagggaccg 120 

gctcacggct tccagggcgc ctccgaccaa aggtgatcga acgacatttc eggattcage 180 

caccgcttcc gactcgatca ttcctgtccc tccccgtcca cgcgcagttg atcttacctc 240 

ctcatcaaga ggatatccac tgaacgaatt atttcaagtg gaagtacttg gagtcgatcc 300 

tacaegtgag tggacgatgc ctgggcgcta gtcggatgtg caacccaccc accccctcct 3 60 

cccgcctacg ccgaagaccg gaaccggcgt cgtccctgcc tgccgtctct ggcaactgtt 420 

gtgaacgccc gagcggccct cacggctctt cagttggcgc ggatcgecat ggcggacgtc 480 

gcccacggcg ggacctacgc atcttcggcc ggaaggcagc cgcggtcacg aacacctagc 540 

ggcagtcgag cacctgagac gaaggccgcc ggcgtcctgt cccggaaatc cgcagcccag 600 

ccgtgacagc caacagtcgt ggcggttccc tcccctccta gggtctttga ctcggcgcca 660 

acgcctgcga gggcgctcgt cgcggaccac ttgtcgaggt cggtgccgca cgtcaccgag 720 

cgcacccttc ttcgtgctct gcgcatcggc ccggaccgcg accgcggcaa cactacgacg 780 

tctgacaatg ctgatcccct gccgccgccg ttggacgacc acagttgeta egagcatgeg 840 

gagccaacca taggcatcat gcgatcgccg gagtcttcat cctattttgg gatgegcagg 900 

attaacacat ctacacattg acatccgttc cgatgtgaag taaaaattgt caegtaggge 960 
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JVU v KHODOCOCCUS BACTERIUM, MTRILASE GENE, 
N1TRYL HYDRA TASE GENE AND AMID AS E GENE FROM 

■^SSX^ PR0CESS TOR PR0DUCTNG 
kmc 26, 2001 
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Sequence Listing 

ggcaggcgaa gtctgeaget cgaacatcga agggtgggag ccgagagatc ggagacgcag 102 0 

acacccggag ggaacttagc ctcccggacc gatgcgtgtc ctggcaacgc ctcaagattc 1080 

agegcaageg attcaatctt gttacttcca gaaccgaatc acgtccccgt agtgtgcggg 1140 

gagagcgccc gaacgcaggg atggtatcca tgcgcccctt ctcttttcga acgagaaccg 12 00 

geeggtacag teaatcegga cacattgtga cgccgttcaa cgattgttgt gctgtgaagg 1260 

attcactcaa gecaactgat atcgccattc cgttgccgga acatttgacg ccttctccct 1320 

acgagtagaa gccagctgga ccctctttga gcccagctcc gatgaaagga atgaggaa 1378 

atg gat ggt ate cac gac aca ggc ggc atg acc gga tac gga ccg gtc 1426 
Met Asp Gly lie His Asp Thr Gly Gly Met Thr Gly Tyr Gly Pro Val 
15 10 15 

ccc tat cag aag gac gag ccc ttc ttc cac tac gag tgg gag ggt cga 1474 
Pro Tyr Gin Lys Asp Glu Pro Phe Phe His Tyr Glu Trp Glu Gly Arg 
20 25 30 

acc ctg teg att ctg acc tgg atg cat etc aag ggc atg teg tgg tgg 1522 
Thr Leu Ser lie Leu Thr Trp Met His Leu Lys Gly Met Ser Trp Trp 
35 40 45 

gac aag teg egg ttc ttc egg gag teg atg ggg aac gaa aac tac gtc 1570 
Asp Lys Ser Arg Phe Phe Arg Glu Ser Met Gly Asn Glu Asn Tyr Val 
50 55 60 

aac gag att cgc aac teg tac tac acc cac tgg ctg agt gcg gcg gaa 1618 
Asn Glu lie Arg Asn Ser Tyr Tyr Thr His Trp Leu Ser Ala Ala Glu 
65 70 75 80 

cgt ate etc gtc gee gac aag ate ate acc gaa gaa gag cga aag cac 1666 
Arg lie Leu Val Ala Asp Lys lie lie Thr Glu Glu Glu Arg Lys His 
85 90 95 

cgc gtg cag gag ate etc gag ggt egg tac acg gac agg aac ccg teg 1714 
Arg Val Gin Glu lie Leu Glu Gly Arg Tyr Thr Asp Arg Asn Pro Ser 
100 105 110 

egg aag ttc gat ccg gec gag ate gag aag gcg ate gag agg ctt cac 1762 
Arg Lys Phe Asp Pro Ala Glu lie Glu Lys Ala lie Glu Arg Leu His 
115 120 125 

gag ccc cac tec eta gtg ctt cca gga gcg gag ccg agt ttc tec etc 1810 
Glu Pro His Ser Leu Val Leu Pro Gly Ala Glu Pro Ser Phe Ser Leu 
130 135 140 

ggt gac aag gtc aaa gtg aag aac atg aac ccg ctg gga cac aca egg 1858 
Gly Asp Lys Val Lys Val Lys Asn Met Asn Pro Leu Gly His Thr Arg 
145 150 155 160 

tgc ccg aag tat gtg egg aac aga ate ggg gaa ate gtc acc tec cac 1906 
Cys Pro Lys Tyr Val Arg Asn Arg lie Gly Glu lie Val Thr Ser His 
165 170 175 



ggg tgc cag ate tat ccc gag age age tec gec ggc etc ggc gac gat 1954 
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NITRYL HYDRATASE GENE AND AMID AS E GENE FROM 
RHODOCOCCUS BACTERIUM. AND PROCESS FOR PRODUCING 
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BMLnrr 202-293-7060 

7 j^^^Hf Sequence Listing 



Gly Cys Gin He Tyr Pro Glu Ser Ser Ser Ala Gly Leu Gly Asp Asp 
180 185 190 

ccc cgc ccg etc tac acg gtc gcg ttt tec gee cag gaa ctg tgg ggc 
Pro Arg Pro Leu Tyr Thr Val Ala Phe Ser Ala Gin Glu Leu Trp Gly 
195 200 205 

gac gac gga aac ggg aaa gac gta gtg tgc gtc gat etc tgg gaa ccg 
Asp Asp Gly Asn Gly Lys Asp Val Val Cys Val Asp Leu Trp Glu Pro 
210 215 220 

tac ctg ate tct gcg tga aaggaatacg ata gtg age gag cac gtc aat 
Tyr Leu He Ser Ala Val Ser Glu His Val Asn 

225 230 235 

aag tac acg gag tac gag gca cgt acc aag gca ate gaa acc ttg ctg 
Lys Tyr Thr Glu Tyr Glu Ala Arg Thr Lys Ala He Glu Thr Leu Leu 
240 245 250 

tac gag cga ggg etc ate acg ccc gec gcg gtc gac cga gtc gtt teg 
Tyr Glu Arg Gly Leu He Thr Pro Ala Ala Val Asp Arg Val Val Ser 
255 260 265 

tac tac gag aac gag ate ggc ccg atg ggc ggt gee aag gtc gtg gee 
Tyr Tyr Glu Asn Glu He Gly Pro Met Gly Gly Ala Lys Val Val Ala 
270 275 280 

aag tec tgg gtg gac cct gag tac cgc aag tgg etc gaa gaa gac gcg 
Lys Ser Trp Val Asp Pro Glu Tyr Arg Lys Trp Leu Glu Glu Asp Ala 
285 290 295 

acg gee gcg atg gcg tea ttg ggc tat gec ggc gag cag gca cac cag 
Thr Ala Ala Met Ala Ser Leu Gly Tyr Ala Gly Glu Gin Ala His Gin 
300 305 310 315 

ate teg gee gtc ttc aac gac tec caa aca cat cac gta gtg gtg tgc 
He Ser Ala Val Phe Asn Asp Ser Gin Thr His His Val Val Val Cys 
320 325 330 

act ctg tgt teg tgc tat ccg tgg ccg gtg ctt ggc etc ccg ccc gee 
Thr Leu Cys Ser Cys Tyr Pro Trp Pro Val Leu Gly Leu Pro Pro Ala 
335 340 345 

tgg tac aag age atg gag tac egg tec cga gtg gta gca gac cct cgt 2483 
Trp Tyr Lys Ser Met Glu Tyr Arg Ser Arg Val Val Ala Asp Pro Arg 
350 355 360 



gga gta etc aag cgc gat ttc ggg ttc gac ate ccc gat gag gtg gag 
Gly Val Leu Lys Arg Asp Phe Gly Phe Asp He Pro Asp Glu Val Glu 
365 370 375 

gtc agg gtt tgg gac age age tec gaa ate cgc tac ate gtc ate ccg 
Val Arg Val Trp Asp Ser Ser Ser Glu He Arg Tyr He Val He Pro 
380 385 390 395 

gaa egg ccg gee ggc acc gac ggt tgg tec gag gac gag ctg gcg aag 
Glu Arg Pro Ala Gly Thr Asp Gly Trp Ser Glu Asp Glu Leu Ala Lys 
400 405 410 

ctg gtg agt egg gac teg atg ate ggt gtc agt aat gcg etc aca ccg 
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NTTRYL HYDRATASE GENE AND AMIDASE GENE FROM 
RHODOCOCCUS BACTERIUM, AND PROCESS FOR PRODUCING 
CARBOXYUC ACIDS USING THEM 
FH«LJuBe26 f 2001 

„...202-293-7060 

Sequence Listing 



CARBOXYUC A 
F^j|^e2^20 



aagcaaegge atteggggtc gccatcgcgc ttteggate 



<210> 4 
<211> 229 
<212> PRT 

<213> Rhodococcus sp . 
<400> 4 

Met Asp Gly He His Asp Thr Gly Gly Met Thr Gly Tyr Gly Pro Val 
15 10 15 



Pro Tyr Gin Lys Asp Glu Pro Phe Phe His Tyr Glu Trp Glu Gly Arg 
20 25 30 



Thr Leu Ser He Leu Thr Trp Met His Leu Lys Gly Met Ser Trp Trp 
35 40 45 

Asp Lys Ser Arg Phe Phe Arg Glu Ser Met Gly Asn Glu Asn Tyr Val 
50 55 60 



Asn Glu He Arg Asn Ser Tyr Tyr Thr His Trp Leu Ser Ala Ala Glu 
65 70 75 80 

Arq He Leu Val Ala Asp Lys He He Thr Glu Glu Glu Arg Lys His 
85 90 95 



Arg Val Gin Glu He Leu Glu Gly Arg Tyr Thr Asp Arg Asn Pro Ser 
100 105 HO 



Arg Lys Phe Asp Pro Ala Glu He Glu Lys Ala He Glu Arg Leu His 
115 120 125 



Glu Pro His Ser Leu Val Leu Pro Gly Ala Glu Pro Ser Phe Ser Leu 
130 135 140 

Gly Asp Lys Val Lys Val Lys Asn Met Asn Pro Leu Gly His Thr Arg 
145 150 155 160 



2723 



Leu Val Ser Arg Asp Ser Met He Gly Val Ser Asn Ala Leu Thr Pro 
415 420 425 

cag gaa gtg ate gta tga gtgaagacac actcactgat cggctcccgg 
Gin Glu Val He Val 
430 

cgactgggac cgccgcaccg ccccgcgaca atggcgagct tgtattcacc gagccttggg 2783 



2822 



Cys Pro Lys Tyr Val Arg Asn Arg He Gly Glu He Val Thr Ser His 
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165 170 175 



Gly Cys Gin He Tyr Pro Glu Ser Ser Ser Ala Gly Leu Gly Asp Asp 
180 185 190 



Pro Arg Pro Leu Tyr Thr Val Ala Phe Ser Ala Gin Glu Leu Trp Gly 
195 200 205 



Asp Asp Gly Asn Gly Lys Asp Val Val Cys Val Asp Leu Trp Glu Pro 
210 215 220 



Tyr Leu He Ser Ala 
225 



<210> 5 

<211> 203 

<212> PRT 

<213> Rhodococcus sp . 

<400> 5 

Val Ser Glu His Val Asn Lys Tyr Thr Glu Tyr Glu Ala Arg Thr Lys 
1 5 10 15 



Ala He Glu Thr Leu Leu Tyr Glu Arg Gly Leu He Thr Pro Ala Ala 
20 25 30 



Val Asp Arg Val Val Ser Tyr Tyr Glu Asn Glu lie Gly Pro Met Gly 
35 40 45 



Gly Ala Lys Val Val Ala Lys Ser Trp Val Asp Pro Glu Tyr Arg Lys 
50 55 60 



Trp Leu Glu Glu Asp Ala Thr Ala Ala Met Ala Ser Leu Gly Tyr Ala 
65 70 75 80 



Gly Glu Gin Ala His Gin He Ser Ala Val Phe Asn Asp Ser Gin Thr 
85 90 95 



His His Val Val Val Cys Thr Leu Cys Ser Cys Tyr Pro Trp Pro Val 
100 105 110 



Leu Gly Leu Pro Pro Ala Trp Tyr Lys Ser Met Glu Tyr Arg Ser Arg 
H5 120 125 



Val Val Ala Asp Pro Arg Gly Val Leu Lys Arg Asp Phe Gly Phe Asp 
130 135 140 



9/15 



AOK1, ct al „Q64574 

NOVEL RHODOCOCCUS BACTERIUM, NTTRELASE GENE, 
NrrRYL^KDRATASE GENE AND AMI DAS E GENE FROM 
RHOC^^KUS BACTERIUM, AND PROCESS FOR PRODUCING 
CAR^^^K ACIDS USING THEM 
Filed: W, 2001 

Bruce E. Kramer - ^..202-293-7000 

10 of 15 Sequence Listing 



lie Pro Asp Glu Val Glu Val Arg Val Trj? Asp'Ser Ser Ser Glu lie 
145 150 155 160 



Arg Tyr lie Val He Pro Glu Arg Pro Ala Gly Thr Asp Gly Trp Ser 
165 170 175 



Glu Asp Glu Leu Ala Lys Leu Val Ser Arg Asp Ser Met He Gly Val 
180 185 190 



Ser Asn Ala Leu Thr Pro Gin Glu Val He Val 
195 200 



<210> 6 

<211> 2822 

<212> DNA 

<213> Rhodococcus sp . 
<220> 

<221> misc_f eature 

<222> (556) . . (556) 

<223> "n" may be a, c, g or t 



<220> 

<221> mi sc_f eature 

<222> (592) . . (592) 

<223> n n" may be a, c, g or t 



<220> 

<221> misc_feature 

<222> (896) . . (896) 

<223> "n" may be a, c, g or t 



<220> 

<221> CDS 

<222> (1094) . . (2491) 

<223> amidase 



<400> 6 
tgattacgaa 


ttcgagctcg 


gtacccgggg 


ateacttegg 


ccagagggtg 


aeggegaaat 


cgggcctcga 


tctccgcgtc 


cacggcgttg 


atacgtgtgt 


cgaggtcgat 


caccgcctgc 


gecaattegg 


cgaccagttc 


ggcagegaca 


tcttcccccg 


gcaaccgcac 


ggtctgcgcc 


ttcgeggegg 


tgactgegge 


ccgggcgatc 


gatteggegt 


ggcgcacccc 


ggccccggtg 


ageattgegg 


ccagtcgggc 


cgccccgacg 


eggeggateg 


ettteggteg 


ctggtagcgg 


gccagcagca 


ccacccagcc 


ccggtccgag 


gagatctgeg 


cgacgcgttc 


gagtcegggg 



60 
120 
180 
240 
300 
360 
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"WAiut AL-lUb USING THEM 
i J^^6, 2001 

-202-293-7060 

f ^^^B Sequence Listing ^^^V 



cagatcgega 


cgagttgctg 


acgcagccgg 


ttgatggtcc 


gggtacggtc 


ggegaccaga 


420 


tcggtgcggt 


ggccggtgag 


catctgcagc 


tcac^gatca 


'actcgtcgtc 


gggaegcaga 


480 


aegggcaggt 


ccgaccgcat 


ccgggactga 


teggegatea 


cccgggcgtc 


gegggegteg 


540 


gtcttggctt 


cgccgncgcg 


gtagaccgac 


gatgcctgcc 


acaccgaacg 


tneggacagg 


600 


tagcgcaccg 


gtttcccggc 


gtcggccagc 


acagtcagca 


acaaggtgac 


gtaggcggtg 


660 


gtcagatcca 


ccgtccacga 


caccgtctcg 


gtgagtgcgt 


egatcteggt 


gatcaccgca 


720 


cggatcgttg 


ettegtegtt 


gtcacatcgc 


cgcgacagca 


ccaccgtccc 


ggaggtgtcg 


780 


agtaegcata 


tccagtggtg 


ttctttgccg 


acgtcgactc 


ctgcccacag 


ttgcgaaccg 


840 


gtcateggat 


ttcctcgttt 


tcgcttgtgt 


tccggcctgg 


ccccgatgga 


cgcctncgcc 


900 


ggcatttcct 


taaacaagcg 


ateatgegea 


gatctcaatc 


ageggtccag 


aggtgtccag 


960 


acaggteggg 


tggccagtcc 


tttcaagccc 


cactcgagag 


tgggcaaacc 


ttatgeagee 


1020 


tcgccggcct 


gecegggtta 


cagctcaacg 


taactctcac 


gaagtaactg 


cacctacgaa 


1080 


cttaaggaac 


etc atg tct teg ttg act ccc ccc aat tec aac 


caa atg 


1129 



Met Ser Ser Leu Thr Pro Pro Asn Ser Asn Gin Met 
15 10 

teg gee ctg aac aac cac ttc cga ttc gga ctg acg acg ccg gaa etc 1177 
Ser Ala Leu Asn Asn His Phe Arg Phe Gly Leu Thr Thr Pro Glu Leu 
15 20 25 

gaa gag ttc gca ccg gee etc gaa gcg acg etc gcg tec tec gaa ace 1225 
Glu Glu Phe Ala Pro Ala Leu Glu Ala Thr Leu Ala Ser Ser Glu Thr 
30 35 40 

gtc gaa cgc etc tac gag cgc acc gcg ccc gag ccg cct cag egg tea 1273 
Val Glu Arg Leu Tyr Glu Arg Thr Ala Pro Glu Pro Pro Gin Arg Ser 
45 50 55 60 

tgg acc tea ccc acg gcg gac gag aac ccg ctg age gee tgg tac gtc 1321 
Trp Thr Ser Pro Thr Ala Asp Glu Asn Pro Leu Ser Ala Trp Tyr Val 
65 70 75 

acc acc teg ate age gaa acc gac gaa ggc ccc etc gee ggg cga acg 13 69 

Thr Thr Ser lie Ser Glu Thr Asp Glu Gly Pro Leu Ala Gly Arg Thr 
80 85 90 

gtc gee gtg aaa gac aac gtc gca gtc gee ggc gtg ccg atg atg aac 1417 
Val Ala Val Lys Asp Asn Val Ala Val Ala Gly Val Pro Met Met Asn 
95 100 105 

ggc tec cga acc gtc gag ggc ttc acc ccc cgc tac gac gee acc gtc 14 65 

Gly Ser Arg Thr Val Glu Gly Phe Thr Pro Arg Tyr Asp Ala Thr Val 
110 115 120 

gta cgc cga ctg etc gac gee ggc gca acc ate acc ggc aaa gcg gtg 1513 
Val Arg Arg Leu Leu Asp Ala Gly Ala Thr lie Thr Gly Lys Ala Val 
125 130 135 140* 
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NITRYL HYDRATASE GENE AND ANHDaSITGENE FROM 
RHODOCOCCUS BACTERIUM, AND PROCESS FOR PRODUCING 
CARBOXYUC ACIDS USING THEM 
Filed: Jux^^2001 

Brace E.^^^ 202-293-7060 

12ofl5^^^H Sequence Listing 



tgc gaa gat etc tgc ttc tec ggc gee age ttc act tec cac ccc cag 1561 
Cys Glu Asp Leu Cys Phe Ser Gly Ala Ser Phe Thr Ser His Pro Gin 
145 l5o 1 155 

ccg gtc cgc aac ccc tgg gac gaa age cgc ate acc ggc ggc teg tec 1609 
Pro Val Arg Asn Pro Trp Asp Glu Ser Arg lie Thr Gly Gly Ser Ser 
160 165 170 

age ggc age ggc gec ctg gtc gec age ggc cag gtg gat atg gca gtc 1657 
Ser Gly Ser Gly Ala Leu Val Ala Ser Gly Gin Val Asp Met Ala Val 
175 180 185 

ggc ggc gac cag ggc ggt teg ate cgc ate ccc gee gcg ttc tgc ggc 1705 
Gly Gly Asp Gin Gly Gly Ser lie Arg lie Pro Ala Ala Phe Cys Gly 
190 195 200 

ate gtc gga cac aaa ccc acc cac gga ctg gtc ccc tat acg gga gca 1753 
lie Val Gly His Lys Pro Thr His Gly Leu Val Pro Tyr Thr Gly Ala 
205 210 215 220 

ttt ccc ate gaa cga acc ate gac cac etc ggt ccg atg acg cgc acg 1801 
Phe Pro lie Glu Arg Thr lie Asp His Leu Gly Pro Met Thr Arg Thr 
225 230 235 

gtc age gac gee gee gca atg etc acc gtc etc gec ggc acc gac ggc 1849 
Val Ser Asp Ala Ala Ala Met Leu Thr Val Leu Ala Gly Thr Asp Gly 
240 245 250 

etc gat ccc cga cag acc cac egg ate gaa ccg gtg gac tac etc gcg 1897 
Leu Asp Pro Arg Gin Thr His Arg lie Glu Pro Val Asp Tyr Leu Ala 
255 260 265 

gcg ctg gec gaa ccc gca teg ggt ctg cgc gtg ggt gtg gtc acc gaa 1945 
Ala Leu Ala Glu Pro Ala Ser Gly Leu Arg Val Gly Val Val Thr Glu 
270 275 280 



ggc ttc gac acc cct gtc tec gac get gec gtc gac aat gee gtg cgc 
Gly Phe Asp Thr Pro Val Ser Asp Ala Ala Val Asp Asn Ala Val Arg 
285 290 295 300 



1993 



acc gee ate ggc gta ctg cgc teg gec gga ctt acc gtc gaa gag gtc 2 041 

Thr Ala lie Gly Val Leu Arg Ser Ala Gly Leu Thr Val Glu Glu Val 
305 310 315 

teg ate ccc tgg cac etc gat gcg atg gec gtc tgg aac gtg ate gac 2089 
Ser lie Pro Trp His Leu Asp Ala Met Ala Val Trp Asn Val lie Asp 
320 325 330 

egg gec gac gac gaa ttc gaa gee ttc ctg ctg cag gtg etc gac gag 2137 
Arg Ala Asp Asp Glu Phe Glu Ala Phe Leu Leu Gin Val Leu Asp Glu 
335 340 345 

aac gec gtc acc ate ccc gaa etc gga cag gtg egg gcg cag acg ccg 2185 
Asn Ala Val Thr lie Pro Glu Leu Gly Gin Val Arg Ala Gin Thr Pro 
350 355 360 

cgc teg tgg tgc tea cct cga acc gca ccc gcg agg tgc acg acg ccc 2233 
Arg Ser Trp Cys Ser Pro Arg Thr Ala Pro Ala Arg Cys Thr Thr Pro 
365 370 375 380 
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* - . «v « ^ n , i ASt O'hNfi AND AMTOASE GENE FROM 

CAR^v??r!n,^l^ mi - ^ PR0CESS F0R PRODUCING 
CARBOXYUC ACIDS USING THEM 
Filed: Jun^^2G01 

f^™--- 20 £? s -™- 

^^^V Sequence Listing 



tea aac gec get gee tgt ace act ggc teg aac acc ccg acc teg cgc 2281 
Ser Asn Ala Ala Ala Cys Thr Thr Gly Ser Asn Thr Pro Thr Ser Arg 
385 3^0 * 395 

ggg aag tgg aga tec tgc gee gee gca tec egg gca teg acg aac acc 232 9 

Gly Lys Trp Arg Ser Cys Ala Ala Ala Ser Arg Ala Ser Thr Asn Thr 
400 405 410 

teg egg cgc agg teg ccc acg ccg tgc agg cca tgc gcg gga tgg acc 23 77 

Ser Arg Arg Arg Ser Pro Thr Pro Cys Arg Pro Cys Ala Gly Trp Thr 
415 420 425 

tgc tea aac cac ccg ggg teg egg agt cgc tgg act ggg cac gag cgc 2425 
Cys Ser Asn His Pro Gly Ser Arg Ser Arg Trp Thr Gly His Glu Arg 
430 435 440 

fc 9 c 999 aac tc 9 acc 9 C 9 ac 9 t9 c tc 9 a ^g cga cga ccg egg ccg cga 24 73 

Cys Gly Asn Ser Thr Ala Thr Cys Ser Thr Arg Arg Pro Arg Pro Arg 
445 450 455 460 

ccc teg gtg ccg tec tga agtaccggga ggacctcgac cgagtggtcc 2521 
Pro Ser Val Pro Ser 
465 

gcaccgggct cgaccggctc ctgacggggt gaeageggeg atgacgacga ccaccgacgc 2581 

egggggttec ctcgtcggac tcaccggctt cacccgcgcc ctcgccgcgg ccggcctgtc 2641 

cgtcgcctcg gacgccaccg tggcctacct gcgcgccctg cgegagateg acctcggcga 2701 

ccgccgtcag gtgtactggg ccgggcgcgc caccctgtgc cacgaccccg acgacatccc 2761 

ccgctacgac ctcgcgttcg agagctggtt eggeggaacg gcacccgacg tgacgtcgcc 2821 

g 2822 



<210> 7 

<211> 465 

<212> PRT 

<213> Rhodococcus sp. 
<220> 

<221> misc_f eature 

<222> (556) . . (556) 

<22 3> "n" may be a, c, g or t 

<220> 

<22 1> mi sc_f eature 

<222> (592) . . (592) 

<223> "n" may be a, c, g or t 

<220> 

<221> misc_feature 

<222> (896) . . (896) 

<22 3> "n" may be a, c, g or t 

<400> 7 



Met Ser Ser Leu Thr Pro Pro Asn Ser Asn Gin Met Ser Ala Leu Asn 
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KITRYL HYDRATASE GENE AND AMI DASH GhNt hKUM 
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14 of^^^B Sequence Listing 



10 15 



Asn His Phe Arg Phe Gly Leu Thr Thr Pro Glu Leu Glu Glu Phe Ala 
20 25 30 



Pro Ala Leu Glu Ala Thr Leu Ala Ser Ser Glu Thr Val Glu Arg Leu 
35 40 45 



Tyr Glu Arg Thr Ala Pro Glu Pro Pro Gin Arg Ser Trp Thr Ser Pro 
50 55 60 



Thr Ala Asp Glu Asn Pro Leu Ser Ala Trp Tyr Val Thr Thr Ser lie 
65 70 75 80 



Ser Glu Thr Asp Glu Gly Pro Leu Ala Gly Arg Thr Val Ala Val Lys 
85 90 95 



Asp Asn Val Ala Val Ala Gly Val Pro Met Met Asn Gly Ser Arg Thr 
100 105 110 



Val Glu Gly Phe Thr Pro Arg Tyr Asp Ala Thr Val Val Arg Arg Leu 
115 120 125 



Leu Asp Ala Gly Ala Thr lie Thr Gly Lys Ala Val Cys Glu Asp Leu 
130 135 140 



Cys Phe Ser Gly Ala Ser Phe Thr Ser His Pro Gin Pro Val Arg Asn 
145 150 155 160 



Pro Trp Asp Glu Ser Arg lie Thr Gly Gly Ser Ser Ser Gly Ser Gly 
165 170 175 



Ala Leu Val Ala Ser Gly Gin Val Asp Met Ala Val Gly Gly Asp Gin 
180 185 190 



Gly Gly Ser lie Arg He Pro Ala Ala Phe Cys Gly He Val Gly His 
195 200 205 



Lys Pro Thr His Gly Leu Val Pro Tyr Thr Gly Ala Phe Pro He Glu 
210 215 220 



Arg Thr He Asp His Leu Gly Pro Met Thr Arg Thr Val Ser Asp Ala 
225 230 235 240 



Ala Ala Met Leu Thr Val Leu Ala Gly Thr Asp Gly Leu Asp Pro Arg 
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RHODOOWre'^^;^*"" AM1DASE GENE FROM 

^SSS^^X^S PROCESS F0R PRODVC ™ 

Filed: June 26, 2001 

E. Kramer...., 



5 



-202-293-7060 



Sequence Listing 

245 250 255 



Gin Thr His Arg lie Glu Pro Val Asp Tyr Leu Ala Ala Leu Ala Glu 
260 265 270 



Pro Ala Ser Gly Leu Arg Val Gly Val Val Thr Glu Gly Phe Asp Thr 
275 280 285 



Pro Val Ser Asp Ala Ala Val Asp Asn Ala Val Arg Thr Ala lie Gly 
290 295 300 



Val Leu Arg Ser Ala Gly Leu Thr Val Glu Glu Val Ser lie Pro Trp 
305 310 315 320 



His Leu Asp Ala Met Ala Val Trp Asn Val lie Asp Arg Ala Asp Asp 
325 330 335 



Glu Phe Glu Ala Phe Leu Leu Gin Val Leu Asp Glu Asn Ala Val Thr 
340 345 350 



lie Pro Glu Leu Gly Gin Val Arg Ala Gin Thr Pro Arg Ser Trp Cys 
355 360 365 



Ser Pro Arg Thr Ala Pro Ala Arg Cys Thr Thr Pro Ser Asn Ala Ala 
370 375 380 



Ala Cys Thr Thr Gly Ser Asn Thr Pro Thr Ser Arg Gly Lys Trp Arg 
385 390 395 400 



Ser Cys Ala Ala Ala Ser Arg Ala Ser Thr Asn Thr Ser Arg Arg Arg 
405 410 415 



Ser Pro Thr Pro Cys Arg Pro Cys Ala Gly Trp Thr Cys Ser Asn His 
420 425 430 



Pro Gly Ser Arg Ser Arg Trp Thr Gly His Glu Arg Cys Gly Asn Ser 
435 440 445 



Thr Ala Thr Cys Ser Thr Arg Arg Pro Arg Pro Arg Pro Ser Val Pro 
450 455 460 



Ser 
465 
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